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Abstract — Feedback control of quantum mechanical systems 
must take into account the probabilistic nature of quantum 
measurement. We formulate quantum feedback control as a 
problem of stochastic nonlinear control by considering separately 
a quantum filtering problem and a state feedback control 
problem for the filter. We explore the use of stochastic Lyapunov 
techniques for the design of feedback controllers for quantum 
spin systems and demonstrate the possibility of stabilizing one 
outcome of a quantum measurement with unit probability. 

Index Terms — stochastic nonlinear control, quantum mechan- 
ics, quantum probability, quantum filtering, Lyapunov functions. 



I. Introduction 

IT IS a basic fact of nature that at small scales — at the level 
of atoms and photons — observations are inherently proba- 
bilistic, as described by the theory of quantum mechanics. The 
traditional formulation of quantum mechanics is very different, 
however, from the way stochastic processes are modeled. The 
theory of quantum measurement is notoriously strange in that 
it does not allow all quantum observables to be measured 
simultaneously. As such there is yet much progress to be 
made in the extension of control theory, particularly feedback 
control, to the quantum domain. 

One approach to quantum feedback control is to circumvent 
measurement entirely by directly feeding back the physical 
output from the system [1], [2]. In quantum optics, where 
the system is observed by coupling it to a mode of the 
electromagnetic field, this corresponds to all-optical feedback. 
Though this is in many ways an attractive option it is clear 
that performing a measurement allows greater flexibility in the 
control design, enabling the use of sophisticated in-loop signal 
processing and non-optical feedback actuators. Moreover, it is 
known that some quantum states obtained by measurement are 
not easily prepared in other ways [3]-[5]. 

We take a different route to quantum feedback control, 
where measurements play a central role. The key to this 
approach is that quantum theory, despite its entirely different 
appearance, is in fact very closely related to Kolmogorov's 
classical theory of probability. The essential departure from 
classical probability is the fact that in quantum theory observ- 
ables need not commute, which precludes their simultaneous 
measurement. Kolmogorov's theory is not equipped to deal 
with such objects: one can always obtain a joint probability 
distribution for random variables on a probability space, im- 
plying that they can be measured simultaneously. Formalizing 
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these ideas leads naturally to the rich field of noncommuta- 
tive or quantum probability [6]-[8]. Classical probability is 
obtained as a special case if we consider only commuting 
observables. 

Let us briefly recall the setting of stochastic control theory. 
The system dynamics and the observation process are usually 
described by stochastic differential equations of the Ito type. 
A generic approach to stochastic control [9], [10] separates 
the problem into two parts. First one constructs a filter which 
propagates our knowledge of the system state given all obser- 
vations up to the current time. Then one finds a state feedback 
law to control the filtering equation. Stochastic control theory 
has traditionally focused on linear systems, where the optimal 
(LQG) control problem can be solved explicitly. 

A theory of quantum feedback control with measurement 
can now be developed simply by replacing each ingredient of 
stochastic control theory by its noncommutative counterpart. 
In this framework, the system and observations are described 
by quantum stochastic differential equations. The next step is 
to obtain quantum filtering equations [11]— [14]. Remarkably, 
the filter is a classical Ito equation due to the fact that the 
output signal of a laboratory measuring device is a classical 
stochastic process. The remaining control problem now re- 
duces to a problem of classical stochastic nonlinear control. 
As in the classical case, the optimal control problem can be 
solved explicitly for quantum systems with linear dynamics. 

The field of quantum stochastic control was pioneered by 
V. P. Belavkin in a remarkable series of papers [11]— [13], [15] 
in which the quantum counterparts of nonlinear filtering and 
LQG control were developed. The advantage of the quantum 
stochastic approach is that the details of quantum probability 
and measurement are hidden in a quantum filtering equation 
and we can concentrate our efforts on the classical control 
problem associated with this equation. Recently the quantum 
filtering problem was reconsidered by Bouten et al. [14] and 
quantum optimal control has received some attention in the 
physics literature [16], [17]. 

The goal of this paper is twofold. We review the basic 
ingredients of quantum stochastic control: quantum probabil- 
ity, filtering, and the associated geometric structures. We then 
demonstrate the use of this framework in a nonlinear control 
problem. To this end, we study in detail an example directly 
related to our experimental apparatus [4]. As this is not a 
linear system, the optimal control problem is intractable and 
we must resort to methods of stochastic nonlinear control. 
We use stochastic Lyapunov techniques to design stabilizing 
controllers, demonstrating the feasibility of such an approach. 

We are motivated in studying the quantum control problem 
by recent developments in experimental quantum optics [4], 
[18]-[20]. Technology has now matured to the point that state- 
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of-the-art experiments can monitor and manipulate atomic 
and optical systems in real time at the quantum limit; i.e. 
the sources of extraneous noise are sufficiently suppressed 
that essentially all the noise is fundamental in nature. The 
experimental implementation of quantum control systems is 
thus within reach of current experiments, with important ap- 
plications in e.g. precision metrology [20]-[23] and quantum 
computing [24], [25]. Further development of quantum control 
theory is an essential step in this direction. 

This paper is organized as follows: in section [H] we give 
an introduction to quantum probability and sketch a simple 
derivation of quantum filtering equations. We also introduce 
the particular physical system that we study in the remainder 
of the paper. In section[n]]we study the dynamical behavior of 
the filtering equation and the underlying geometric structures. 
Finally, section IIVI is devoted to the design of stabilizing 
controllers using stochastic Lyapunov methods. 

II. Quantum probability and filtering 

The purpose of this section is to clarify the connections 
between quantum mechanics and classical probability theory. 
The emphasis is not on rigor as we aim for a brief but broad 
overview; we refer to the references for a complete treatment. 

A. Finite-dimensional quantum probability 

We begin by reviewing some of the traditional elements of 
quantum mechanics (e.g. [26]) with a probabilistic flavor. 

An observable of a finite-dimensional quantum system is 
represented by a self-adjoint linear operator X = X* on 
some underlying finite-dimensional complex Hilbert space Ti 
(* denotes Hermitian conjugation). Every self-adjoint operator 
has a spectral decomposition 



Pi = Pf = P* 



(1) 



where Xi are the eigenvalues of X and Pj are projectors onto 
orthogonal eigenspaces in Ti such that ^\ Pj = Id^. 

If we were to measure X we would obtain one of the values 
Xi as the measurement outcome. The Pi represent the events 
that can be measured. To complete the picture we still need a 
probability measure. This is provided by the density operator 
p, which is a linear operator on Ti satisfying 



Trp = 1, 



P > 0. 



The probability of an event Pi is given by 

Pi = Tr[pPj]. 
We can now easily find the expectation of X: 
(X)=Y,X l Tr{ P P l } = Tr{pX}. 



(2) 



(3) 



(4) 



In quantum mechanics p is also called the system state. 

As in classical probability, it will be useful to formalize 
these ideas into a mathematical theory of quantum probability 
[6]-[8]. The main ingredient of the theory is the quantum 
probability space (^4, p). Here A is a ^-algebra, i.e. an algebra 
with involution * of linear operators on Ti, and p is the 



associated state. An observable on (A, p) is a sum of the form 
with Pi 6 A. In the finite-dimensional case this implies 
that every observable is a member of A, but we will see that 
this need not be the case in infinite dimensions. 

A does not necessarily contain all self-adjoint operators 
on Ti. Of special importance is the case in which A is a 
commutative algebra, i.e. all the elements of A commute 
([X,Y] = XY - YX = VX,Y € A.) It is easily verified 
that there is a one-to-one correspondence (up to isomorphism) 
between commutative quantum probability spaces (^4, p) and 
classical probability spaces (O, J 7 , P) with cardO = dimTi. 
As A is commutative we may represent all its elements 
by diagonal matrices; the diagonals are then interpreted as 
functions / : £1 — > K. The projectors Pi £ A now correspond 
to indicator functions \Ai on £1 and hence define the cr-algebra 
T = {A,}. Finally, P is defined by P[A<] = Ti[P lP \. 

Clearly classical probability is a special case of quan- 
tum probability. However, noncommutative A are inherent to 
quantum mechanical models. Suppose A, B are two events 
(projectors) that do not commute. Then A and B cannot 
be diagonalized simultaneously, and hence they cannot be 
represented as events on a single classical probability space. 
Suppose we wish to measure A and B simultaneously, i.e. 
we ask what is the probability of the event (A and B)l In 
the classical case this would be given by the joint probability 
P[A,P] = ¥[A l~l B] = E[xaXb]- However in the noncom- 
mutative case this expression is ambiguous as Tr[pAP] ^ 
Tr[pPA]. We conclude that (A and B) is an invalid question 
and its probability is undefined. In this case, the events A and 
B are said to be incompatible. Similarly, two observables on 
A can be measured simultaneously only if they commute. 

We conclude this section with the important topic of con- 
ditional expectation. A traditional element of the theory of 
quantum measurement is the projection postulate, which can 
be stated as follows. Suppose we measure an observable X 
and obtain the outcome A^. Then the measurement causes the 
state to collapse: 

PipPi 



Tr[pP] 



(5) 



Suppose that we measure another observable X 1 after measur- 
ing X. Using we write 

Tr [pPiP^Pi, 



P[X' = \' j \X = X i }=Tr[P; P \ i } 



Tr[pP] 



(6) 



Now compare to the definition of conditional probability in 
classical probability theory: 

\BO A] 



[B\A] 



[A] 



A.B&T. 



(7) 



Clearly (J6J and @ are completely equivalent if X, X' com- 
mute. It is now straightforward to define the quantum analog 
of conditional expectation: 



£[X'\B] 



\- TrjpPX'P] 



(8) 



Here B is the ^-algebra generated by X, i.e. it is the algebra 
whose smallest projectors are Pj. This definition also coincides 
with the classical conditional expectation if X, X' commute. 
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We obtain ambiguous results, however, when X, X' do 
not commute, as then the fundamental property (£[X'\B]) = 
{X') is generally lost. This implies that if we measure an 
observable, but "throw away" the measurement outcome, the 
expectation of the observable may change. Clearly this is 
inconsistent with the concept of conditional expectation which 
only changes the observer's state of knowledge about the 
system, but this is not surprising: noncommuting X, X' cannot 
be measured simultaneously, so any attempt of statistical 
inference of X' based on a measurement of X is likely to 
be ambiguous. To avoid this problem we define the condi- 
tional expectation only for the case that X' commutes with 
every element of B. The measurement B is then said to be 
nondemolition [11] with respect to X'. 

The essence of the formalism we have outlined is that the 
foundation of quantum theory is an extension of classical 
probability theory. This point of view lies at the heart of quan- 
tum stochastic control. The traditional formulation of quantum 
mechanics can be directly recovered from this formalism. Even 
the nondemolition requirement is not a restriction: we will 
show that the collapse rule l|5} emerges in a quantum filtering 
theory that is based entirely on nondemolition measurements. 

B. Infinite-dimensional quantum probability 

The theory of the previous section exhibits the main 
features of quantum probability, but only allows for finite- 
state random variables. A general theory which allows for 
continuous random variables is developed along essentially 
the same lines where linear algebra, the foundation of finite- 
dimensional quantum mechanics, is replaced by functional 
analysis. We will only briefly sketch the constructions here; a 
lucid introduction to the general theory can be found in [6]. 

A quantum probability space (A, p) consists of a Von 
Neumann algebra A and a state p. A Von Neumann algebra is 
a ^-algebra of bounded linear operators on a complex Hilbert 
space H and p : A — > C is a linear map such that p(Id-n) = 1, 
p(A*A) > VA e A and p(A*A) = iff A = 0. We gloss 
over additional requirements related to limits of sequences 
of operators. It is easily verified that the definition reduces 
in the finite-dimensional case to the theory in the previous 
section, where the density operator p is identified with the 
map X i— > Tr[/oX]. We always assume Id-j-c € A. 

As in the finite-dimensional case there is a correspon- 
dence between classical probability spaces and commutative 
algebras. Given the classical space (CI, P) the associated 
quantum probability space is constructed as follows: 

H = L 2 (n;C), A = L™(n;C), p:f^ffdP(9) 

Jn 

where A acts on H by pointwise multiplication. Conversely, 
every commutative quantum probability space corresponds to 
a classical probability space. This fundamental result in the 
theory of operator algebras is known as Gel'fand's theorem. 

Observables are represented by linear operators that are 
self-adjoint with respect to some dense domain of Ti. The 
spectral decomposition Q is now replaced by the spectral 



theorem of functional analysis, which states that every self- 
adjoint operator X can be represented as 

X=[\E(d\), E:B R ^V{H). (10) 

Here E is the spectral or projection- valued measure associated 
to X, V(H) is the set of all projection operators on fi, and Br 
is the Borel cr-algebra on R. X is affiliated to A if E(A) e A 
VA € 23r, replacing the concept of measurability in classical 
probability theory. For X affiliated to A the probability law 
and expectation are given by 

P[X e A] = p(E(A)), (X) = J Xp(E(dX)). (11) 

Note that unlike in finite dimensions not all observables affil- 
iated to A are elements of A; observables may be unbounded 
operators, while A only contains bounded operators. 

It remains to generalize conditional expectations to the 
infinite-dimensional setting, a task that is not entirely straight- 
forward even in the classical case. Let B C A be a commuta- 
tive Von Neumann subalgebra. As before we will only define 
conditional expectations for observables that are not demol- 
ished by B, i.e. for observables affiliated to the commutant 
B' = {A e A : [A, B] = VB e B}. 

Definition 1: The conditional expectation onto B is the 
linear surjective map £[-|i3] : B 1 — > B with the following 
properties: for all A £ B' 

1) £[ld H \B} = U H , 

2) £[A\B] > if A > 0, 

3) £[5iAB 2 |£] = Bt.£[A\B\B 2 VBi,B 2 G B, and 

4) p{£[A\B]) = p{A). 

The definition extends to any observable X affiliated to B' by 
operating £[-|i3] on the associated spectral measure. 

It is possible to prove (e.g. [14]) that the conditional 
expectation exists and is unique. 

C. Quantum stochastic calculus 

Having extended probability theory to the quantum setting, 
we now sketch the development of a quantum Ito calculus. 

We must first find a quantum analog of the Wiener process. 
Denote by (f2, T , P) the canonical Wiener space of a classical 
Brownian motion. The analysis in the previous section sug- 
gests that quantum Brownian motion will be represented by a 
set of observables on the Hilbert space V = L 2 (Cl;C). Define 
the symmetric Fock space over L 2 (U) as 

oo 

T S {L 2 (U)) = Ce0i 2 (C/;C) 0n , UcR+ (12) 

n=l 

where denotes the symmetrized tensor product. It is well 
known in stochastic analysis (e.g. [8]) that V and r s (L 2 (M + )) 
are isomorphic, as every i 2 -functional on fl is associated to 
its Wiener chaos expansion. Now define the operators 

n . 

A„k = hi © • • • ki • • • k n / g*h dt 
9 h Jw + (13) 

A* g k = g ki • • • © k n 
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where k = kx • • • k n , g, ki G L 2 (R + ) and fc, means that 
the term % is omitted. It is sufficient to define the operators for 
such vectors as their linear span To is dense in T. We get 

[A B ,A h ] = [A*,A%] = 0, [A g ,A* h ] = f g*hdt (14) 

and indeed (v, A g w) = {A*v, w) for v, w G Tq. 

We will construct Wiener processes from A and A*, but 
first we must set up the quantum probability space. We take 
A to contain all bounded linear operators on T. To construct 
p consider the vector A = 1 © <E r s (L 2 (R+)). Then 



p : A 



p(X) = (A,XA). 



Now consider the operator A*+A g . Using (II 41 and the Baker- 
Campbell-Hausdorff lemma we obtain 



AK+Ag)\ _ 



(A,e 



'A) 



(16) 



where ||<7|| 2 is the integral of \g\ 2 over R + . However, the 
characteristic functional of a classical Wiener process is 

E [ e ij °°~9(t)dw t] = e -m\ 2 ( i7) 

where g is a real function. Clearly A* + A g is equivalent in 
law to a classical Wiener integral, and any Q t — A* + A gt 
with gt{s) = X[o,t]{ s ) e lip ^ is a quantum Wiener process. 

It is easy to verify that [Q t , Q s ] = Vi, s. This important 
property allows us to represent all Q t ,tE K + on a single clas- 
sical probability space, and hence Q t is entirely equivalent to 
a classical Wiener process. Two such processes with different 
(p do not commute, however, and are thus incompatible. 

The Fock space dl 2t i has the following factorization prop- 
erty: for any sequence of times t\ < t<i <...<*„ G M. + 



r = r tl] ®r tllta 



(18) 



with r M = r s (L 2 ([ s ,t})), r t] = r 0>t and r [t = r ti00 . Thus r 

can be formally considered as a continuous tensor product over 
T s (L 2 ({t})), a construction often used implicitly in physics 
literature. A process St is called adapted if St — S t ] Id in 
r t ] T[ t for every t G R+. Qt is adapted for any cp. 
It is customary to define the standard noises 



At — A X[0tt] , 



A* — A* 



t G 



(19) 



One can now define Ito integrals and calculus with respect to 
At , A* in complete analogy to the classical case. We will only 
describe the main results, due to Hudson and Parthasarathy 
[27], and refer to [7], [8], [27] for the full theory. 

Let H be the Hilbert space of the system of interest; we 
will assume that dim7i < oo. Now let A be the set of all 
bounded operators on H <£> T. The state p = p^ pr is given 
in terms some state pn on H and pr as defined in (I15> . The 
Hudson-Parthasarathy equation 



Id+ f (LdA* t -L* dA t -{iH+\L*L)dt)U s , t (20) 

J s 

defines the flow U s t of the noisy dynamics. Here L and H are 
operators of the form L ® Id on H ® T and H is self-adjoint. It 
can be shown that U s> t is a unitary transformation of Ti ®T S:t 



and U s .t — Ut.tU s ,k- Given an observable S at time 0, the 
flow defines the associated process St — Uq ( SC/o,t- 

Quantum stochastic differential equations are easily ma- 
nipulated using the following rules. The expectation of any 
integral over dA t or dA* vanishes. The differentials dA t , dA^ 
commute with any adapted process. Finally, the quantum Ito 
rules are dA t dA; = dt, dA 2 = (dA* t ) 2 = dA; dA t = 0. 

Let X G TL be any system observable; its time evolution is 
given by jt(X) = U^ t {X ® ld)U . t - We easily obtain 

djt(X) = jt{CX)dt+j t ([L* ,X])dA t +3t{[X,L])dA$ (21) 



(15) where CX = i[H,X] + L*XL - \{L*LX + XL*L). This 



expression is the quantum analog of the classical Ito formula 



dj t (f)=jt(J?f)dt + jt(Zf)dW t 



(22) 



where jt{f) = f(%t) with dx t — b(x t ) dt + a{x t ) dW t , S£ is 
the infinitesimal generator of xt and E/ = cr l dif . Similarly, 
C is called the generator of the quantum diffusion U s ,t- 

In fact, the quantum theory is very similar to the classi- 
cal theory of stochastic flows [28], [29] with one notable 
exception: the existence of incompatible observables does 
not allow for a unique sample path interpretation (x t in the 
classical case) of the underlying system. Hence the dynamics 
is necessarily expressed in terms of observables, as in J2 1 1 . 

D. Measurements and filtering 

We now complete the picture by introducing observations 
and conditioning the system observables on the observed 
process. The following treatment is inspired by [12], [13]. 

1 ) Classical filtering: To set the stage for the quantum fil- 
tering problem we first treat its classical counterpart. Suppose 
the system dynamics J22I is observed as y t with 



dyt = jt(h) dt + ndV t 



(23) 



for uncorrected noise V t with strength n > 0. We wish to 
calculate the conditional expectation tvt(f) = ^\jt{f)\^t\- 

Recall the classical definition: E[X|JF] is the JF-measurable 
random variable such that E[E[X|.F]Y] = E[XY] for all 
jF-measurable Y, Suppose T is generated by some random 
variable F, The definition suggests that to prove X — E[X|.F] 
for some JF-measurable X, it should be sufficient to show that 



E[Xe n ] = E[Xe n ] G K, 



(24) 



i.e., the conditional generating functions coincide. 

We will apply this strategy in the continuous case. As 7r t (/) 
is an J^-semimartingale we introduce the ansatz 



dirt{f)=Ctdt + D t dy t 



(25) 



with C t ,D t ^f-adapted. We will choose C t ,D t such that 
E[ef-7r t (/)] = E[efjt(/)] for all functions g, where 



of _ Jo g(s)dy s -hK 2 J* g(s) 2 ds 



de 9 t = g(t)e 9 t dy t . (26) 



The Ito correction term in the exponent was chosen for 
convenience and does not otherwise affect the procedure. 
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Using Ito's rule and the usual properties of conditional 
expectations we easily obtain 

dE[efj t (f)} 



— W\ c 3 



dt 

gE[e|7rt(/)] 
dt 



Tr t (J?f)+g(ty t Mhf)} 



- E[ef(C t + nt(h)D t ) 



(27) 



(28) 



+ g(ty t (K 2 D t + Tr t (h)n t (f))]. 
Requiring these expressions to be identical for any g gives 



dw t {f)=n t (&f)dt- 



[ (ir t (hf)-7r t (h)ir t (f))dW t (29) 



where the innovations process dWt — K _1 (c?y t — ir t (h) dt) 
is a Wiener process. Eq. \29\ is the well-known Kushner- 
Stratonovich equation of nonlinear filtering [30], [31]. 

2) Quantum filtering: The classical approach generalizes 
directly to the quantum case. The main difficulty here is how 
to define in a sensible way the observation equation d23i ? 

We approach the problem from a physical perspective [32]. 
The quantum noise represents an electromagnetic field coupled 
to the system (e.g. an atom.) Unlike classically, where any 
observation is in principle admissible, a physical measurement 
is performed by placing a detector in the field. Hence the same 
noise that drives the system is used for detection, placing a 
physical restriction on the form of the observation. 

We will consider the observation Y{ = UQ t (A^ + A t )U .t + 
k{B\ '+B t ). Here B t is a noise uncorrected from A t that does 
not interact with the system (the Hilbert space is Ti <g) T <g) T, 
etc.) Physically we are measuring the field observable A^+A t 
after interaction with the system, corrupted by uncorrelated 
noise of strength k > 0. Using the Ito rule and J20i we get 



dYl = j t (L* +L)dt + dA* t + dA t + n{dB; + dB t ). (30) 

It is customary in physics to use a normalized observation Y t 
such that dY 2 — dt. We will use the standard notation 



dY t = 0j (j t (L* + L)dt + dA* t +dA t ) + ^T^dVt (31) 

where V t = B* + B t and r\ = (1 + k 2 )- 1 S (0, 1]. 
Y( and Y t satisfy the following two crucial properties: 

1) Y( is self-nondemolition, i.e. [Y/ , Y[\ ~ Vs < t. To see 
this, note that [Y/.Y/] = [Ul t Q t U ,u U^ s Q s U 0tS ] with 
Qt = A% + At. But U s .t is a unitary transformation of 
H®T s>t and Q s = ld®Q s ]®ld on H®r s ]®r[ s ; thus we 
get U* it Q a Us,t = Q s U* st U s . t = Q s , so U$ s Q s U 0iS = 
US >t Q a U 0>t . But then [Y{X] = Uo t t[QuQ»\Uo,t = 
as we have already seen that Q t is self-nondemolition. 

2) Y"/ is nondemolition, i.e. [j t (X), Y[\ = Vs < t for all 
system observables X on H. The proof is identical to 
the proof of the self-nondemolition property. 

These properties are essential in any sensible quantum filtering 
theory: self-nondemolition implies that the observation is a 
classical stochastic process, whereas nondemolition is required 
for the conditional expectations to exist. A general filtering 
theory can be developed that allows any such observation [11], 
[12]; we will restrict ourselves to our physically motivated Y t . 

We wish to calculate TTt(X) = £[j t (X)\Bt] where Bt is the 
algebra generated by Y s < t - Introduce the ansatz 




Beam splitter 



Fig. 1. Schematic of an experiment for continuous quantum measurement 
and control. The spin interacts with an optical mode, which is measured 
continuously by homodyne detection. A magnetic field is used for feedback. 



where Ct,D t are affiliated to Bt- Define 

^e/o's^.-i/o's^^ de 9 t = g(t)e 9 t dY t . (33) 
Using the quantum Ito rule and Def . ^ we g et 



d(e g t jt(X)) 
dt 

dt 



= (e?ir t (£X)+ 

g(ty t Tr t (XL + L*X)^) 

= (e 9 t (C t +ir t (L* +L)D t ^j) + 



(34) 



(35) 



dir t {X) = C t dt + D t dY t 



(32) 



g(t)ef(D t + ir t (L* + L)ir t (X)^n)). 
Requiring these expressions to be identical for any g gives 

dn t (X) = n t {CX) dt + ^Tj{7r t (XL + L*X) 

- ML* + L)n t (X))(dY t - y/vML* + L) dt) (36) 

which is the quantum analog of j29l >. It can be shown that 
the innovations process dWt = dY t — ^/rjn t {L* + L) dt is 
a martingale (e.g. [14]), and hence it is a Wiener process by 
Levy's classical theorem. 

E. The physical model 

Quantum (or classical) probability does not by itself de- 
scribe any particular physical system; it only provides the 
mathematical framework in which physical systems can be 
modeled. The modeling of particular systems is largely the 
physicist's task and a detailed discussion of the issues involved 
is beyond the scope of this article; we limit ourselves to 
a few general remarks. The main goal of this section is to 
introduce a prototypical quantum system which we will use 
in the remainder of this article. 

The emergence of quantum models can be justified in differ- 
ent ways. The traditional approach involves "quantization" of 
classical mechanical theories using an empirical quantization 
rule. A more fundamental theory builds quantum models as 
"statistical" representations of mechanical symmetry groups 
[33], [34]. Both approaches generally lead to the same theory. 
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The model considered in this paper (Fig. ^ is prototypical 
for experiments in quantum optics; in fact, it is very similar to 
our laboratory apparatus [4]. The system consists of a cloud of 
atoms, collectively labeled "spin", interacting with an optical 
field (along z) produced by a laser. After interacting with the 
system the optical field is detected using a photodetector con- 
figuration known as a homodyne detector. A pair of magnetic 
coils (along y) are used as feedback actuators. 

The optical and magnetic fields are configured so they 
only interact, to good approximation, with the collective 
angular momentum degrees of freedom of all the atoms 
[35]. Rotational symmetry implies that observables of angular 
momentum must form the rotation Lie algebra so(3). If we 
impose additionally that the total angular momentum is con- 
served, then it is a standard result in quantum mechanics [26] 
that the angular momentum observables form an irreducible 
representation of so (3). Such a system is called a spin. 

We take H to be the spin Hilbert space. Any finite dimension 
2 < dim7i < oo supports an irrep of so(3); the choice 
of dim7i = 2j + 1 depends on the number of atoms and 
their properties. We can choose an orthonormal basis {ip m £ 
H, to = —j, — j + 1, . . . ,j} such that the observables J x , y ,z 
of angular momentum around the x, y, z-axis are defined by 1 

Jx^Pm CmV'm+l ^ C— m^Pm — 1 

Jy4>m = iCmi'm+1 ~ iC- m lp m -i (37) 
Jzi>m = mip m 

with c m = \\J {j — rn){j + m + 1). It is easily verified that 
Jx, y ,z indeed generate so(3), e.g. [J x , J y ] = iJ z . 

Note that J x ,y,z are discrete random variables; the fact 
that angular momentum is "quantized", unlike in classical 
mechanics, is one of the remarkable predictions of quantum 
mechanics that give the theory its name. Another remarkable 
non-classical effect is that J XViZ are incompatible observables. 

The noise in our model and its interaction with the atoms 
emerges naturally from quantum electrodynamics, the quan- 
tum theory of light [36]. Physical noise is not white; however, 
as the correlation time of the optical noise is much shorter than 
the time scale of the spin dynamics, a quantum analog of the 
classical Wong-Zakai procedure [37], [38] can be employed 
to approximate the dynamics by an equation of the form J20i . 
In fact, the term — \L*L in ( 1201 is precisely the Wong-Zakai 
correction term that emerges in the white noise limit. 

We now state the details of our model without further 
physical justification. The system is described by ( I20> with 
L = \[M J z and H = B(t) J y . Here M > is the strength of 
the interaction between the light and the atoms; it is regulated 
experimentally by the optical cavity. B(t) is the applied mag- 
netic field and serves as the control input. Finally, homodyne 
detection [32] provides exactly the measurement 2 13 H . where 
■q is determined by the efficiency of the photodetectors. 

'Angular momentum is given in units of h ~ 1.055 X 10 -34 kgm 2 s — 1 . 
To simplify the notation we always work in units such that h = 1. 

-In practice one measures not Yt but its formal derivative I(t) = dY t /dt. 
As in classical stochastics we prefer to deal mathematically with the integrated 
observation Yt rather than the singular "white noise" photocurrent /(f). 



In the remainder of the paper we will study the spin system 
°f Fig- HI Before we devote ourselves entirely to this situation, 
however, we mention a couple of other common scenarios. 

Often L is not self-adjoint; in this case, the system can 
emit or absorb energy through interaction with the field. This 
situation occurs when the optical frequency of the cavity field 
is resonant with an atomic transition. In our case the frequency 
is chosen to be far off-resonant; this leads to self-adjoint L 
after adiabatic elimination of the cavity dynamics (e.g. [16]). 
The filter dynamics in this scenario, to be described below, 
is known as state reduction. The sequence of approximations 
that is used for our particular model is described in [39]. 

Finally, a different detector configuration may be chosen. 
For example, a drastically different observation, known as 
photon counting, gives rise to a Poisson (jump) process. We 
refer to [32] for a full account of the quantum stochastic 
approach to observations in quantum optics. 

III. Geometry and dynamics of the filter 

In the previous section we introduced our physical model. A 
detailed analysis resulted in the filtering equation d36i . where 
TTt(X) is the best estimate of the observable X given the 
observations Y s < t - We will now study this equation in detail. 

Note that (1361 is driven by the observation Y t , which is a 
classical stochastic process. Hence (I36> is entirely equivalent 
to a classical Ito equation. This is an important point, as 
it means that in the remainder of this article we only need 
classical stochastic calculus. 

A. The state space 

We begin by investigating the state space on which the filter 
evolves. Clearly d36l defines the time evolution of a map -n t \ 
we will show how this map can be represented efficiently. 

The map 7r f associates to every observable X on Ti a 
classical stochastic process which represents the expectation 
of X conditioned on the observations up to time t. It is easily 
verified that 7r t is linear, identity-preserving, and maps positive 
observables to positive numbers: in fact, it acts exactly like the 
expectation of X with respect to some finite-dimensional state 
on Ti. We will denote this state by p t , the conditional density 
at time t, where by definition TTt(X) = Tr[p t X]. 

It is straightforward to find an expression for p t . We get 

dp t = £* ptdt+^j{Lp t +ptL*-Tr[pt(L+L*)] P t) dW t (38) 

with the innovations dW t = dY t — y/rjTr[p t (L+L*)} dt and the 
adjoint generator C* p — —i[H, p] + LpL* — \(L* Lp + pL* L). 
In physics this equation is also known as a quantum trajectory 
equation or stochastic master equation. 

Let dim7Y = n; as n is finite, we can represent linear 
operators on Ti by complex matrices. Thus J38l > is an ordinary, 
finite-dimensional Ito equation. We saw in section llLAl that p t 
is a density matrix, i.e. it belongs to the space 

V = {p £ C nxn :p = p\ Trp = 1, p > 0}. (39) 

By construction V is an invariant set of ( I38l l. and forms the 
natural state space of the filter. 
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B. Geometry of V 

The geometry of V is rather complicated [40]. To make the 
space more manageable we will reparametrize V so it can be 
expressed as a semialgebraic set. 

Let us choose the matrix elements p^ of p as follows. For 
i > j set p^ = Xij + ipij with Ajj,/i,j G M. For i < j set 
Pij = p*i- Finally, choose an integer k between 1 and n. For 
i 7^ k set pa = Vi, Vi <E R, and p kk = 1 — Y^t^k u i- Collect 
all n 2 — 1 numbers Ay , p^ , into a vector A. Then clearly 
the map h : A i— > p is an isomorphism between K™ _1 and 
{p€ C" x " : p= p*, Tip = 1}. 

It remains to find the subset K C R" ~ 1 that corresponds 
to positive definite matrices. This is nontrivial, however, as it 
requires us to express nonnegativity of the eigenvalues of p as 
constraints on p^. The problem was solved by Kimura [40] 
using Descartes' sign rule and the Newton-Girard identities 
for symmetric polynomials; we quote the following result: 

Proposition 1: Define k p (p), p = 2 . . . n recursively by 

Pk p (p) = Et- 1 )^ 1 K-M (4°) 

q=l 

with fco = ki = 1. Define the semialgebraic set 

K = {A G M™ 2-1 : k p (h(A)) > 0, p = 2...n}. (41) 

Then h is an isomorphism between K and P. 

Note that 2fc 2 = 1 - Tr[p 2 ] > implies ||A|| 2 = £\ ^ + 
£*>j( A ?j + Mij) < Tr[/i(A) 2 ] < 1. Hence K is compact. 

We work out explicitly the simplest case n — 2 (spin j = |). 
Set pn = iv, p 2 2 = 1 — v, p 2 i = A + i/i = p| 2 . Then 

#2 = {A = (A, p, v) G R 3 : A 2 + p 2 + 1/(1/ - 1) < 0}. (42) 

This is just a solid sphere with radius i, centered at (0, 0, 5). 
The case n = 2 is deceptively simple, however: it is the only 
case with a simple topology [40], [41]. 

We can also express d38l in terms of A. Specifically, we 
will consider the spin system L = sfM J z , H = B(t) J y in 
the basis V1/2 = (1,0), V-1/2 = (0, 1) on C 2x2 . We obtain 

d\ t = {B{t){v t - |) - |MA t ) dt + y/Mr)X t {l - 2v t ) dW t 
dp t = -\Mp t dt + x/M^(l - 2v t ) dW t 
dv t = -B(t)\ t dt - 2^/Uh)Vt(v t - 1) dW t . (43) 
By construction, K2 is an invariant set for this system. 

C. Convexity and pure states 

Just like its classical counterpart, the set of densities V is 
convex. We have the following fundamental result: 

Proposition 2: The set V is the convex hull of the set of 
pure states Q = {vv* € C nxn : v G C n , \\v\\ = 1} C V . 

Proof: As any p G V is self-adjoint it can be written as 
p = Aif if ,*, where Vi are orthonormal eigenvectors of p 
and A, are the corresponding eigenvalues. But Trp =1, p > 
imply that J^Aj = 1 and Aj G [0,1]. Hence V C conv Q. 
Conversely, it is easily verified that conv Qc?. ■ 

Pure states are the extremal elements of V\ they represent 
quantum states of maximal information. Note that classically 



extremal measures are deterministic, i.e. P[A] is either or 
1 for any event A. This is not the case for pure states p = 
vv*, however: any event A = ww* with < < 1, 

||ui|| = 1 will have < Tr[pA] < 1. Thus no quantum state is 
deterministic, unless we restrict to a commutative algebra A. 

Intuitively one would expect that if the output Y t is not 
corrupted by independent noise, i.e. r\ = 1, then there is no loss 
of information, and hence an initially pure po would remain 
pure under J38i . This is indeed the case. Define 

dv t = \{h t L -\L*L-\h 2 t -iH)dt+{L-h t ) dW t ] v t (44) 

where h t = i« ( *(L* + L)vt- Then it is easily verified that 
Pt — Vtv'£ obeys (1381 with 77 = 1. It follows that if r\ = 1, Q 
is an invariant set of (1381 . In the concrete example ( I43> it is 
not difficult to verify this property directly: when 7/ = 1, the 
sphere A 2 + p? + v(y — 1) = is invariant under (143 \ . 

D. Quantum state reduction 

We now study the dynamics of the spin filtering equation 
without feedback B{t) = 0. We follow the approach of [42]. 
Consider the quantity Vt = nt(J 2 ) — TTt(J z ) 2 - We obtain 

^M = -4Mr,E^ 2 ]. (45) 
dt 

Clearly E[V t 2 ] > 0, so Ep4] decreases monotonically. But 
V t > and E[V t 2 ] = iff = a.s. We conclude that 

lim E[V t ] = (46) 

t — >oo 

and hence Vt — > a.s. as t — > 00. But the only states p G 
V with V t = Tr[J 2 p] - Tr[J^p] 2 = are the eigenstates 
V'mV'm °f Jz- Hence in the long-time limit the conditional 
state collapses onto one of the eigenstates of J z , as predicted 
by (0 for a "direct" measurement of J z . 

With what probability does the state collapse onto eigenstate 
to? To study this, let us calculate irtfymipm)- We get 

d7T t (V> m VC) = 2^/M^TT t {^ m ^* m ){m-TT t {J z ))dW t . (47) 

Clearly irt(ipm"4>m) i s a martingale, so 

Prn = E[7T 00 (V' TO V , m)] = TofcC)' ( 4 8) 

We have already shown that p^ is one of V'nV'n' an d as 
the ip m are orthonormal this implies that TT Qa {4'mi } * n ) = 
r ^ I [Poo' t Pmipm] is 1 if 72 = to and otherwise. Thus p m is 
just the probability of collapsing onto the eigenstate m. But 
note that 7r (?/'m'!/ ; m) = Tr[poV'mV , m]> so <l48i gives exactly the 
same collapse probability as the "direct" measurement (|3j- 

We conclude that the predictions of quantum filtering theory 
are entirely consistent with the traditional quantum mechanics. 
A continuous reduction process replaces, but is asymptotically 
equivalent to, the instantaneous state collapse of section IH-AI 
This phenomenon is known as quantum state reduction 7 '. We 

3 The term state reduction is sometimes associated with quantum state 
diffusion, an attempt to empirically modify the laws of quantum mechanics so 
that state collapse becomes a dynamical property. The state diffusion equation, 
which is postulated rather than derived, is exactly 1441 with L = L* . We use 
the term state reduction as describing the reduction dynamics without any 
relation to its interpretation. The analysis of Ref. [42] is presented in the 
context of quantum state diffusion, but applies equally well to our case. 
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System: Eq. (21) 



Y t : Eq.(31] 



B(t) 



Digital processing 



Controller 



Filter: Eq.(38) 



Fig. 2. Schematic of the feedback control strategy. The output from the 
system is used to propagate the conditional state of the filter. The feedback 
signal is of state feedback form with respect to the conditional state. 



emphasize that quantum filtering is purely a statistical infer- 
ence process and is obtained entirely through nondemolition 
measurements. Note also that state reduction occurs because 
L = J z is self-adjoint; other cases are of equal physical 
interest, but we will not consider them in this paper. 

Physically, the filtering approach shows that realistic mea- 
surements are not instantaneous but take some finite time. The 
time scale of state reduction is of order M , an experimen- 
tally controlled parameter. A carefully designed experiment 
can thus have a reduction time scale of an order attainable by 
modern digital electronics [43], which opens the door to both 
measuring and manipulating the process in real time. 

IV. Stabilization of spin state reduction 
A. The control problem 

It is a standard idea in stochastic control that an output feed- 
back control problem can be converted into a state feedback 
problem for the filter [9], [10]. This is shown schematically 
in Fig. |2] The filtering equations d36i or J38l > are driven by 
Y t ; hence, at least in principle, the conditional state p t can be 
calculated recursively in real time by a digital processor. 

The filter describes optimally our knowledge of the system; 
clearly the extent of our knowledge of the system state limits 
the precision with which it can be controlled. The best we 
can hope to do is to control the system to the best of our 
knowledge, i.e. to control the filter. The latter is a well-posed 
problem, despite that we cannot predict the observations Y t , 
because we know the statistics of the innovations process Wt- 

For such a scheme to be successful the system dynamics 
(121 1 must be known, as the optimal filter is matched to the 
system dynamics. Designing controllers that perform well 
even when the system dynamics is not known precisely is 
the subject of robust control theory. Also, efficient signal 
processing algorithms and hardware are necessary to propagate 
(1381 in real time, which is particularly problematic when 
dim7i is large. Neither of these issues will be considered in 
this paper. 

The state reduction dynamics discussed in the previous 
section immediately suggests the following control problem: 
we wish to find state feedback B(t) — $>(pt) so that one of 
the eigenstates p = f/'mV'm is globally stabilized. The idea 
that a quantum measurement can be engineered to collapse 
deterministically onto an eigenstate of our choice is somewhat 
remarkable from a traditional physics perspective, but clearly 
the measurement scenario we have described provides us 
with this opportunity. For additional motivation and numerical 
simulations relating to this control problem, see [3], 



B. Stochastic stability 

In nonlinear control theory [44] stabilization of nonlinear 
systems is usually performed using the powerful tools of 
Lyapunov stability theory. In this section we will describe 
the stochastic counterpart of deterministic Lyapunov theory, 
developed in the 1960s by Has'minskii and others. We will 
not give proofs, for which we refer to [45]-[48]. 

Let Wt be a Wiener process on the canonical Wiener space 
(f2, P). Consider an Ito equation on R™ of the form 



dx t = b(x t ) dt + a(x t ) dW t 



(49) 



where b, a : M™ — * M™ satisfy the usual linear growth and local 
Lipschitz conditions for existence and uniqueness of solutions 
[49]. Let x* be a fixed point of g9}, i.e. b(x*) = a{x*) = 0. 

Definition 2: The equilibrium solution x t = x* of (149 \ is 

1) stable in probability if 



lim 

xn — >x* 



sup \Xt — x | > e 

t>0 



= Ve > 0, 
2) asymptotically stable if it is stable in probability and 



lim 

Xq — >x* 



lim \xt 







1. 



3) globally stable if it is stable in probability and 



lim \xt 







1. 



Note that 1 and 2 are local properties, whereas 3 is a global 
property of the system. 

Recall that the infinitesimal generator of xt is given by 

d_ 1^;,,;,, d 2 
dx l 



(50) 



so dE[f(xt)]/dt = E[Jif f (xt)]. We can now state the stochas- 
tic equivalent of Lyapunov's direct method [45]-[47]. 

Theorem 1: Define Uh — {x : \x — x*\ < h}. Suppose 
there exists some h > and a function V : Uh — * R+ that 
is continuous and twice differentiable on Uh\{x*}, such that 
V(x*) = and V(x) > otherwise, and ££V{x) < on U h . 
Then the equilibrium solution xt = x* is stable in probability. 
If ££V{x) < on Uh\{x*} then x* is asymptotically stable. 

Theorem [0 is a local theorem; to prove global stability we 
need additional methods. When dealing with quantum filtering 
equations a useful global result is the following stochastic 
LaSalle-type theorem of Mao [48]. In the theorem we will 
assume that the dynamics of ( 149 \ are confined to a bounded 
invariant set G. 

Theorem 2: Let G be a bounded invariant set with respect 
to the solutions of \A9\ and xq € G. Suppose there exists a 
continuous, twice differentiable function V : G — » R+ such 
that &V(x) < Vx G G. Then lim^^ &V{x t ) = a.s. 

Finally, we will find it useful to prove that a particular fixed 
point repels trajectories that do not originate on it. To this end 
we use the following theorem of Has'minskii [45]. 

Theorem 3: Suppose there exists some h > and a func- 
tion V : Uh — > M that is continuous and twice differentiable 
on Uh\{x*}, such that 

lim V(x) = +oo 
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Fig. 3. Cartoons of the various control schemes; the arrows denote the rotation direction of the magnetic field, (a) Almost global control on the circle: the 
magnetic field always rotates in the direction of least distance to 9 = n, but 9 = remains a fixed point, (b) Global control on the circle: we intentionally 
break the symmetry of the controller to remove the undesired fixed point. The graphs show a typical feedback law and Lyapunov design with M = 1, 
B(9) = | sin6* — |(1 + cos 9), V(9) = (§ + sin0)(l + cos 9). (c) A neighborhood of (A, v) = (0, 1) showing why the almost global control law fails 
on the disc. The control vanishes on the line A = 0; hence points on this line are never repelled with unit probability, in violation of 1511 . 



and S£V{x) < on Uh\{x*}. Then the equilibrium solution 
x t = x* is not stable in probability, and moreover 



sup \ xt — x 
t>o 



< h 



= 



Vx G U h \{x*}. (51) 



C. A toy problem: the disc and the circle 

We treat in detail an important toy problem: spin j = h- 
The low dimension and the simple topology make this problem 
easy to visualize. Nonetheless we will see that the stabilization 
problem is not easy to solve even in this simple case. 

We have already obtained the filter d43t on Ki for this 
case. Conveniently, the origin in K-i is mapped to the lower 
eigenstate V'-i/2' i / ; * xlv we w ^ attempt to stabilize this state. 

Note that the equations for \t,vt are decoupled from \i t . 
Moreover, the only point in with (A, v) = (0, 0) has /i = 0. 
Hence we can equivalently consider the control problem 



d\ t = (B{t){v t -\)-\M\ t )dt- 
dv t = -B(t)X t dt - 2y/Mrji/ t (vt 



jMr)\t{\-2 Vt )dW t 
l)dW t (52) 

on the disc B 2 = {(A, v) G R 2 : A 2 + v{y - 1) < 0}. 
Controlling J52I is entirely equivalent to controlling j43l >. 
as globally stabilizing (A, v) = (0, 0) guarantees that fi is 
attracted to zero due to the geometry of K2- 

An even simpler toy problem is obtained as follows. Sup- 
pose T) = 1; we have seen that then the sphere A 2 + /i 2 + 
v(y — 1) = is invariant under d43l >. Now suppose that 
additionally fj,Q = 0. Then clearly the circle S 1 — {(A, v) G 
R 2 : A 2 + v(y — 1) = 0} is an invariant set. We find 

d0 t = {B{t) - |Msin0 t cos0 t ) dt - y/M sin0 t dW t (53) 

after a change of variables (2Af, 2v t ) — (sm9 t , 1 + cos0 t ). 

The system (I52> could in principle be realized by per- 
forming the experiment of Fig. ^ with a single atom. The 
reduced system J53i is unrealistic, however: it would require 
perfect photodetectors and perfect preparation of the initial 
state. Nonetheless it is instructive to study this case, as it 
provides intuition which can be applied in more complicated 
scenarios. Note that (1531 is a special case of ( I52t where rj = 1 
and the dynamics is restricted to the boundary of B 2 . 



D. Almost global control on S 1 

We wish to stabilize (A, v) = (0, 0), which corresponds to 
= 7r. Note that by J53i a positive magnetic field B > 
causes an increasing drift in 8, i.e. a clockwise rotation on 
the circle. Hence a natural choice of controller is one which 
causes the state to rotate in the direction nearest to 9 = it from 
the current position. This situation is sketched in Fig. |3jt. 

A drawback of any such controller is that by symmetry, the 
feedback must vanish not only on 9 = n but also on 8 — 0; 
hence 9 = remains a fixed point of the controlled system 
and the system is not globally stable. We will show, however, 
that under certain conditions such feedback renders the system 
almost globally stable, in the sense that all paths that do not 
start on 9 = are attracted to 8 = tt a.s. 

For simplicity we choose a controller that is linear in (A, v): 

B(t) = 2GX t = Gsin£5 t , G > 0. (54) 
Here G is the feedback gain. The generator of d53l > is then 

if = (Gsinc? - IM sin 8 cos(9)— + iMsin 2 6>— . (55) 
z d9 * o9 l 

As a first step we will show that the fixed point 8 = ix is 
asymptotically stable and that the system is always attracted 
to one of the fixed points (there are no limit cycles etc.) To 
this end, consider the Lyapunov function 



V{8) = 1 + cos 8, 
We obtain 

jSfV(0) 



V{tt) = 0, V{8 ^ tt) > 0. (56) 



-Gsin 2 0. 



(57) 



It follows from Theorem^that 9 = tt is asymptotically stable, 
and from Theorem [2] that lim^oo t G {0,7r} a.s. 

What remains to be shown is that any trajectory which does 
not start on 9 = ends up at 9 = tt a.s. To prove this, consider 

V (6) =-log(l-cos0), lim V(9) = +oo. (58) 
We easily find 

jgfV(0) = cos 2 (0/2) (M + M cos — 2G). (59) 
Now note that 

&V{0) < V0 G (-7r,7r)\{0} iff G> M. (60) 
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Thus by Theorem [3] we have 
if 



sup|0f| 

i>0 



< 7T 



9 e (-7r,7r)\{0}. 



(61) 



But as 9 e S 1 this implies 9 t — ► tt a.s. if 8 e (— 7r, 7r)\{0}. 
We conclude that the control law 1541 almost globally stabi- 
lizes the system if we have sufficient gain G > M. 

E. Global control on S 1 

Any deterministic system on the circle is topologically 
obstructed 4 from having a globally stabilizing controller: a 
continuous vector field on S 1 with a stable fixed point neces- 
sarily has an unstable fixed point as well. In the stochastic case, 
however, this is not the case. Though the drift and diffusion 
terms must each have two fixed points, we may design the 
system in such a way that only the stable fixed points coincide. 

To apply such a trick in our system we must break the 
natural symmetry of the control law. This situation is shown 
in Fig. [3p. There is a region of the circle where the control 
rotates in the direction with a longer distance to 9 — it; the 
advantage is that 9 = is no longer a fixed point. 

The linear control law that has this property has the form 

B(t) = 2G\ t + 2Hv t = Gsin9 t + H(l + cos6» t ) (62) 

with G > 0. We can prove global stability by applying 
Theorems Q an d ID with a Lyapunov function of the form 



V{9) = (a + sin0)(l 



cost 



a > 1. 



(63) 



Unfortunately it is not obvious from the analytic form of j£f V 
how a must be chosen to satisfy the Lyapunov condition. It 
is however straightforward to plot jSf V, so that in this simple 
case it is not difficult to search for a by hand. 

A typical design for a particular choice of parameters is 
shown in Fig. [3p. The conditions of Theorems □ andEl are 
clearly satisfied, proving that the system is globally stable. 
Note that when the symmetry is broken we no longer need to 
fight the attraction of the undesired fixed point; hence there is 
no lower bound on G. In fact, in Fig. |3p we have G < M. 

F. Almost global control on B 2 

Unfortunately, the simple almost global control design on 
S 1 does not generalize to B 2 . The problem is illustrated in 
Fig. |5J;. The controller J54I vanishes at 9 = and n, but we 
can prove that 9 = is repelling. On B 2 , however, the control 
vanishes on the entire line A = which becomes an invariant 
set of (1521 . But then it follows from (1481 that any trajectory 
with Aq = 0, vq {0,1} has a nonzero probability of being 
attracted to either fixed point. 

Consider a neighborhood Uh of the point (X,v) = (0,1) 
that we wish to destabilize. For any h > 0, however small, Uh 
contains points on the line A = for which v < 1, and we have 
seen that trajectories starting at such points have a nonzero 
probability of being attracted to (0, 1). But this violates i5\i . 
so clearly we cannot prove Theorem [3] on B 2 . 

4 This is only the case for systems with continuous vector fields and 
continuous, pure state feedback. The obstruction can be lifted if one considers 
feedback laws that are discontinuous or that have explicit time dependence. 



One could attempt to prove that all points except those with 
A = are attracted to the origin with unit probability. The 
Lyapunov theory of section IIV-BI is not equipped to handle 
such a case, however, and new methods must be developed 
[50]. Instead, we will focus on the global control problem. 

G. Global control on B 2 and semialgebraic geometry 
Once again we consider the asymmetric control law 

B{t) = 2GX t + 2Hv t , G > (64) 

and try to show that it globally stabilizes the system. Before 
we can solve this problem, however, we must find a systematic 
method for proving global stability. Searching "by hand" for 
Lyapunov functions is clearly impractical in two dimensions, 
and is essentially impossible in higher dimensions where the 
state space cannot be visualized. 

In fact, even if we are given a Lyapunov function V, testing 
whether Jzf V < on K is highly nontrivial. The problem can 
be reduced to the following question: is the set {A 6 M™ _1 : 
££V > 0, k p (h(A)) > 0, p = 2 . . . n} empty? Such problems 
are notoriously difficult to solve and their solution is known 
to be NP-hard in general [51]. 

The following result, due to Putinar [52], suggests one way 
to proceed. Let S be a semialgebraic set, i.e. S = {x G M m : 
Si(x) > 0, i = 1 . . . n} with polynomial Sj. Suppose that for 
some i the set {x € K m : Si(x) > 0} is compact. Then any 
polynomial p that is strictly positive on S is of the form 

a 

p{x) =Po(x) +^2p t (x)si(x), p k (x) = ^Pkj{x) 2 (65) 

i=l j 

where p^j are polynomials; i.e., p is an affine combination of 
the constraints Sj and sum-of-squares polynomials 

Conversely, it is easy to check that any polynomial of 
the form \65\ is nonnegative on S. We may thus consider 
the following relaxation: instead of testing nonnegativity of a 
polynomial on S, we may test whether the polynomial can 
be represented in the form d65i . Though it is not true that 
any nonnegative polynomial on S can be represented in this 
form, Putinar's result suggests that the relaxation is not overly 
restrictive. The principal advantage of this approach is that 
the relaxed problem can be solved in polynomial time using 
semidefinite programming techniques [53], [54]. 

The approach is easily adapted to our situation as K is a 
semialgebraic set, and we solve the relaxed problem of testing 
whether — Jz?V can be expressed in the form ( I65> . In fact, the 
semidefinite programming approach of [53], [54] even allows 
us to search for polynomial V such that d65l > is satisfied; hence 
we can search numerically for a global stability proof using 
a computer program. Such searches are easily implemented 
using the Matlab toolbox SOSTOOLS [55]. 

A typical design for a particular choice of parameters is 
shown in Fig. |4] After fixing the parameters AI = 2, rj = \, 
and the control law B(t) = 4A t - v t , an SOSTOOLS search 
found the Lyapunov function 



V(A, v) = 21.8 v — 5.73 A 2 + 10.4 Xv - 5.63 1 



(66) 
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Fig. 4. Contour plot of SfV for the control law B(t) = 4At — vt, with 
M = 2 and r/ = ^ ■ The function V was found by semidefinite programming. 

where — _£f V is of the form ( I65> . Hence Theorems ^ an d ID 
are satisfied, proving that the system is globally stable. 

A couple of technical points should be made at this 
point. Note that formally the filtering equation (I38t and its 
parametrizations do not satisfy the linear growth condition. 
However, as the filter evolves on a compact invariant set K, 
we could modify the equations smoothly outside K to be of 
linear growth without affecting the dynamics in K. Hence the 
results of section HV-Bl can still be used. Moreover, it is also not 
strictly necessary that V be nonnegative, as adding a constant 
to V does not affect JiCV. Hence it is sufficient to search for 
polynomial V using SOSTOOLS. 

H. Global control for higher spin 

The approach for proving global stability described in the 
previous section works for arbitrary spin j. To generalize our 
control scheme we need to convert to the parametrization of 
section Ifll-BI as we did for spin j = ~ in (I52> . We must also 
propose a control law that works for general spin systems. 

We do not explicitly convert to the parametrized form 
or generate the constraints k p , as this procedure is easily 
automated using Matlab's symbolic toolbox. Note that the pa- 
rameter k determines which eigenstate is mapped to the origin. 
This is convenient for SOSTOOLS searches, as polynomials 
can be fixed to vanish at the origin simply by removing the 
constant term. We always wish to stabilize the origin in the 
parametrized coordinate system. 

To speed up computations we can eliminate all the parame- 
ters /iy as was done in going from (I43> to d52i . The fact that 
the remaining equations are decoupled from /i y is easily seen 
from Q8t . as both iJ y and J z are real matrices. Moreover 
it is easily verified that, by convexity of K, the orthogonal 
projection of any p G K onto {R™ _1 : faj — Vi > j} lies 
inside K. Hence we only need to consider the reduced control 
problem with /iy = 0. 

In [3] we numerically studied two control laws for general 
spin systems. The first law, Bi(t) — iit{JxJz + JzJx — 2mdJx) 
(rrid is the eigenstate we wish to stabilize), reduces to our 
almost global control law when j = I. However, numerical 
simulations suggest that for j > i this control law gives a 
finite collapse probability onto m ^ m^. The second law, 
B2(t) — ir t (J z ) — irid, reduces to B^if) = v t in the case 



j = i, which is not locally stable. Our experience with i = \ 
suggests that a control law of the form 

B(t) =GTT t {J x Jz+JzJx-1m d J x )+H{-K t (J z )-m d ) (67) 

should globally stabilize the eigenstate ma of a spin j system. 

We have verified global stability for a typical design with 
j = 1, M = 2, r, = i, and B(t) = 2ir t (J x J z + J z J x )+ir t {J z ) 
using SOSTOOLS. A Lyapunov function was indeed found 
that guarantees global stability of the eigenstate i/'oV'o- 

Physically the case j > \ is much more interesting than 
j = i. An experiment with j > \ can be performed with 
multiple atoms, in which case the control produces statistical 
correlations between the atoms. Such correlations, known as 
entanglement, are important in quantum computing. The struc- 
ture of the control problem is, however, essentially the same 
for any j. We refer to [3], [56] for details on entanglement 
generation in spin systems. 

V. Conclusion 

In this paper we have argued that quantum mechanical 
systems that are subjected to measurement are naturally treated 
within the framework of (albeit noncommutative) stochastic 
filtering theory. The quantum control problem is then reduced 
to a classical stochastic control problem for the filter. We have 
demonstrated the viability of this approach by stabilizing state 
reduction in simple quantum spin systems using techniques of 
stochastic nonlinear control theory. 

Unfortunately, the stabilization techniques of section IIVI 
have many drawbacks. We do not have a systematic procedure 
for finding control laws: we postulate linear controllers and 
search for corresponding Lyapunov functions. Even when the 
control law is known, verifying global stability is nontrivial 
even in the simplest case. Our numerical approach, though 
very successful in the examples we have shown, rapidly 
becomes intractable as the dimension of the Hilbert space 
grows. Finally, our methods do not allow us to make general 
statements; for example, though it seems plausible that the 
control law J67t is globally stabilizing for any j,nid, M,rj, 
H 7^ and G > 0, we have not yet succeeded in proving 
such a statement. 

Nonetheless we believe that the general approach outlined in 
this paper provides a useful framework for the control of quan- 
tum systems. It is important in this context to develop methods 
for the control of classical stochastic nonlinear systems [57]- 
[60], as well as methods that exploit the specific structure 
of quantum control problems. The design of realistic control 
systems will also require efficient signal processing algorithms 
for high-dimensional quantum filtering and methods for robust 
quantum control [61]. 
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